59 research outputs found

    Binarization of Ancient Document Images based on Multipeak Histogram Assumption

    Get PDF
    In document binarization, text is segmented from the background. This is an important step, since the binarization outcome determines the success rate of the optical character recognition (OCR). In ancient documents, that are commonly noisy, binarization becomes more difficult. The noise can reduce binarization performance, and thus the OCR rate. This paper proposes a new binarization approach based on an assumption that the histograms of noisy documents consist of multipeaks. The proposed method comprises three steps: histogram calculation, histogram smoothing, and the use of the histogram to track the first valley and determine the binarization threshold. In our simulations we used a set of Jawi ancient document images with natural noises. This set is composed of 24 document tiles containing two noise types: show-through and uneven background. To measure performance, we designed and implemented a point compilation scheme. On average, the proposed method performed better than the Otsu method, with the total point score obtained by the former being 7.5 and that of the latter 4.5. Our results show that as long as the histogram fulfills the multipeak assumption, the proposed method can perform satisfactorily.

    Enhancement of Iris Recognition System Based on Phase Only Correlation

    Get PDF
    Iris recognition system is one of biometric based recognition/identification systems. Numerous techniques have been implemented to achieve a good recognition rate, including the ones based on Phase Only Correlation (POC). Significant and higher correlation peaks suggest that the system recognizes iris images of the same subject (person), while lower and unsignificant peaks correspond to recognition of those of difference subjects. Current POC methods have not investigated minimum iris point that can be used to achieve higher correlation peaks. This paper proposed a method that used only one-fourth of full normalized iris size to achieve higher (or at least the same) recognition rate. Simulation on CASIA version 1.0 iris image database showed that averaged recognition rate of the proposed method achieved 67%, higher than that of using one-half (56%) and full (53%) iris point. Furthermore, all (100%) POC peak values of the proposed method was higher than that of the method with full iris points.    

    Fine Tuning CNN Pre-trained Model Based on Thermal Imaging for Obesity Early Detection

    Get PDF
    Obesity is a complex disease that causes serious impact health, such as diabetes mellitus, cardiovascular disease, cancer, and stroke. An early obesity diagnosis/ detection method is required to prevent the increasing number of obese people. This study aims to: (i) fine-tune the pre-trained Convolutional Neural Network (CNN) models to build an early detection of obesity and (ii) evaluate the model performance in terms of classifying performance, computation speed, and learning performance. The thermal images acquisition procedure was conducted with 18 normal subjects and 15 obese subjects to build a thermal images dataset of obesity. Pre-trained CNN models: VGG19, MobileNet, ResNet152V, and DenseNet201 were modified and trained using the acquired dataset as the input. The training results show that the DenseNet201 model outperformed other models regarding classifying accuracy: 83.33 % and learning performances. At the same time, the MobileNet model outperformed other models in terms of computation speed with training elapsed time: 12 seconds/epoch. The proposed DenseNet201 model was suitable for implementation as an early screening system of obesity for health workers or physicians. Meanwhile, the proposed MobileNet model was suitable for mobile applications' early detection/diagnosis of obesity

    Studi Komparasi Kinerja Teknologi Near Field Communication Pada Sistem Berbasis Android dan Embedded System

    Get PDF
    Penelitian ini membahas perbandingan kinerja dua perangkat berbasis Android dan Embedded system dalam melakukan pembacaan tag Near Field Communication.  Android dengan modul NFC NXP PN544 dan Embedded system dengan modul MFRC522. Studi kasus yang dilakukan untuk penelitian ini dengan menerapkannya pada manajemen aset. Penelitian ini bertujuan mengetahui kinerja dari kedua perangkat tersebut terdapat perbedaan signifikan atau tidak. Untuk mengetahui hal tersebut, penelitian ini menggunakan metode statistik uji Mann Whitney yang menghitung variabel waktu dan jarak ketika tag NFC berhasil terdeteksi, membaca, dan menampilkan informasi. Dengan metode uji Mann Whitney tersebut dapat diketahui terdapat perbedaan signifikan dari kedua perangkat tersebut berdasarkan hasil yang didapatkan. Setelah dilakukan pengujian, jarak efektif pada kedua alat didapatkan pada jarak 15 mm, 10 mm, dan 5 mm dengan pengujian pembacaan 15 tag NFC pada setiap jarak. Kemudian hasil tersebut diuji dengan metode Mann Whitney pada jarak 15 mm didapatkan nilai signifikasi sebesar .001, kemudian pada jarak 10 mm didapatkan nilai signifikasi sebesar .000, serta jarak 5 mm didapatkan nilai signifikasi sebesar .000 dan dari ketiga pengujian tersebut membuktikan bahwa kedua perangkat terdapat perbedaan signifikan dalam melakukan pembacaan tag NFC

    Effectiveness of MPEG-7 Color Features in Clothing Retrieval

    Get PDF
    Clothing is a human used to cover the body. Clothing consist of dress, pants, skirts, and others. Clothing usually consists of various colors or a combination of several colors. Colors become one of the important reference used by humans in determining or looking for clothing according to their wishes. Color is one of the features that fit the human vision. Content Based Image Retrieval (CBIR) is a technique in Image Retrieval that give index to an image based on the characteristics contained in image such as color, shape, and texture. CBIR can make it easier to find something because it helps the grouping process on image based on its characteristic. In this case CBIR is used for the searching process of Muslim fashion based on the color features. The color used in this research is the color descriptor MPEG-7 which is Scalable Color Descriptor (SCD) and Dominant Color Descriptor (DCD). The SCD color feature displays the overall color proportion of the image, while the DCD displays the most dominant color in the image. For each image of Muslim women's clothing, the extraction process utilize SCD and DCD. This study used 150 images of Muslim women's clothing as a dataset consistingclass of red, blue, yellow, green and brown. Each class consists of 30 images. The similarity between the image features is measured using the eucludian distance. This study used human perception in viewing the color of clothing.The effectiveness is calculated for the color features of SCD and DCD adjusted to the human subjective similarity. Based on the simulation of effectiveness DCD result system gives higher value than SCD

    Improving the Performance of CBIR on Islamic Women Apparels Using Normalized PHOG

    Get PDF
    The designs of Islamic women apparels is dynamically changing, which can be shown by emerging of online shops selling clothing with fast updates of newest models. Traditionally, buying the clothes online can be done by querying the keywords to the retrieval system. The approach has a drawback that the keywords cannot describe the clothes designs precisely. Therefore, a searching based on content–known as content-based image retrieval (CBIR)–is required. One of the features used in CBIR is the shape. This article presents a new normalization approach to the Pyramid Histogram of Oriented Gradients (PHOG) as a mean for shape feature extraction of women Islamic clothing in a retrieval system. We refer to the proposed approach as normalized PHOG (NPHOG). The Euclidean distance measured the similarity of the clothing. The performance of the system was evaluated by using 340 clothing images, comprised of four clothing categories, 85 images for each category: blouse-pants, long dress, outerwear, and tunic. The recall and precision parameters measured the retrieval performance; the Histogram of Oriented Gradients (HOG) and PHOG were the methods for comparison. The experiments showed that NPHOG improved the HOG and PHOG performance in three clothing categories

    Moment invariant-based features for Jawi character recognition

    Get PDF
    Ancient manuscripts written in Malay-Arabic characters, which are known as "Jawi" characters, are mostly found in Malay world. Nowadays, many of the manuscripts have been digitalized. Unlike Roman letters, there is no optical character recognition (OCR) software for Jawi characters. This article proposes a new algorithm for Jawi character recognition based on Hu’s moment as an invariant feature that we call the tree root (TR) algorithm. The TR algorithm allows every Jawi character to have a unique combination of moment. Seven values of the Hu’s moment are calculated from all Jawi characters, which consist of 36 isolated, 27 initial, 27 middle, and 35 end characters; this makes a total of 125 characters. The TR algorithm was then applied to recognize these characters. To assess the TR algorithm, five characters that had been rotated to 90o and 180o and scaled with factors of 0.5 and 2 were used. Overall, the recognition rate of the TR algorithm was 90.4%; 113 out of 125 characters have a unique combination of moment values, while testing on rotated and scaled characters achieved 82.14% recognition rate. The proposed method showed a superior performance compared with the Support Vector Machine and Euclidian Distance as classifier

    Penggunaan Gray Level Co-Occurance Matrix Dari Koefisien Aproksimasi Wavelet untuk Deteksi Cacat Tekstil

    Get PDF
    Pendeteksian cacat tekstil saat ini masih dilakukan secara manualmengakibatkan seseorang sulit mendeteksi lebih dari 60% dari cacat yang ada.Untuk itu, penelitian ini menerapkan metode deteksi cacat tekstil secara otomatismenggunakan Gray Level Co-Occurance Matrix (GLCM) dari koefisienaproksimasi wavelet yang bertujuan untuk mengevaluasi analisis kinerja metode.Tahapannya, sampel citra tekstil dibagi menjadi delapan bagian untukmendapatkan tekstur cacat yang lebih jelas. Bagian tersebut didekomposisikedalam dua level. GLCM dihitung dari koefisien aproksimasi wavelet level satudan dua untuk dijadikan fitur. Penelitian ini dilakukan empat set simulasi citradengan orientasi latar berbeda. Setiap set terdiri dari satu citra noncacat dan duajenis citra cacat. Setiap bagian citra noncacat dihitung jaraknya dengan semuabagian pada citra cacat pertama dan kedua menggunakan jarak euclidean. Hasilsimulasi menunjukkan bahwa GLCM dari koefisien aproksimasi wavelet levelkedua mampu mendeteksi lebih dari 70% dari cacat yang ada

    Features for Cross Spectral Image Matching: A Survey

    Get PDF
    In recent years, cross spectral matching has been gaining attention in various biometric systems for identification and verification purposes. Cross spectral matching allows images taken under different electromagnetic spectrums to match each other. In cross spectral matching, one of the keys for successful matching is determined by the features used for representing an image. Therefore, the feature extraction step becomes an essential task. Researchers have improved matching accuracy by developing robust features. This paper presents most commonly selected features used in cross spectral matching. This survey covers basic concepts of cross spectral matching, visual and thermal features extraction, and state of the art descriptors. In the end, this paper provides a description of better feature selection methods in cross spectral matching

    Ordinal Measure of Discrete Cosine Transform Coefficients and its Application to Fingerprint Matching

    Get PDF
    Recently, the identification system is not limited in using an ID and personal identification number (PIN) but also in using biometriccharacteristics.One of biometric characteristics that has been widely used is fingerprint.This paper proposes a fingerprint matching algorithms using ordinal measure of DCT coefficient. The ordinal measure of DCT coefficient is generated from DCT blocks with size 8x8 pixels. Matching level was determined by computing the Minkowski distance between features of input fingerprint image and fingerprint images in the database. The simulations were accomplished using 128 fingerprints that have been normalized, from which as many as 1024 genuine attempts and 15360 impostor attempts were generated. The proposed algorithms achievedan Equal Error Rate (EER) at threshold 0.3. At the EER, it resulted in FAR value of 0.82%, and FRR value of 78.41% respectively.The low value of FAR showed that the system wasconsiderably secure.DOI:http://dx.doi.org/10.11591/ijece.v3i6.437
    • …
    corecore